Topic recognition - s4647936 #165

tranvicky · 2023-10-25T13:34:05Z

This pull request introduces an Alzheimer's Disease classification solution using Siamese networks on the ADNI dataset. The Siamese Network, with a Simple Classifier, allows for the differentiation between Alzheimer's Disease (AD) and Normal Control (NC) brain images. The primary objective is to use triplet loss and the power of Siamese networks to achieve this classification.

Key Highlights:

Implementation of a Siamese network paired with a simple classifier for AD vs. NC classification.
The use of the TripletDataset for generating triplets of anchor, positive, and negative images for training.
Data augmentation techniques to enhance model robustness during training.
Visualisation of embeddings post-training using t-SNE.
In-depth analysis using confusion matrices to evaluate the performance of the classifier on test embeddings.

Files Added:

train.py: Script containing the Siamese network training logic, visualisation of results, and classifier training.
modules.py: Contains the architecture for the Feature Extractor, Siamese Network, Triplet Loss, and the Simple Classifier.
dataset.py: Provides the TripletDataset class for generating triplets and the logic for a patient-wise dataset split.
predict.py: Script to generate embeddings using the trained Siamese Network, visualise them, and evaluate the classifier's performance.
Various plots and figures showcasing the t-SNE visualisations, training vs. validation loss, and confusion matrices.

Setup and Requirements:

Requires PyTorch 1.10.1+cu113 or above.
Dependencies include torchvision, Pillow, matplotlib, seaborn, numpy, and scikit-learn.
Detailed setup, installation, and usage instructions can be found in the README.

Thanks,
Vicky (s4647936)

…ernAnalysis-2023 into topic-recognition

…function in modules.py.

… patient), and negative (different class) image triplets for Siamese training.

…d anchor selection and optimized triplet formation for Siamese network training.

…n train.py

…hecking for NaN and Inf values in the dataset

… data, improved triplet selection using patient IDs, and streamlined path extraction using os.path.basename

…ernAnalysis-2023 into topic-recognition

… flip, and vertical flip) when in train mode

…dded visualization of sample images post-training.

…ning

…rform predictions on new sample data.

…nd added functionality to save the 2D scatter plot as embeddings_pca.png.

…taset in train.py This will enable subsequent classifier training on Siamese network embeddings.

… and use of labels when processing embeddings post Siamese network training.

…te the classification of Siamese network embeddings into AD or NC categories

…ed embeddings from the Siamese Network. Utilized the Adam optimizer and CrossEntropyLoss for training.

…dings. The accuracy metric is computed to assess the classifier's performance on the test set.

…sualise.

…n losses.

…lidation losses. This enhancement aids in monitoring the classifier's performance across epochs.

…ve for 5 consecutive epochs).

…in classification evaluation.

…incorrect embedding sizing.

…layer

…ers.

…ning and also added results summary.

… can now just run the file. Updated with saving all visualising plots.

…or users.

…aset.py.

…d of images and to display images properly.

nathasha-naranpanawa · 2023-11-06T00:40:19Z

This is an initial inspection, no action is required at this point

Difficulty: Hard

Readme: Very good

Clear flow of information
Algortihm and usage explained well
Relevant plots for loss and 2D manifolds provided

Commit messages: very good, detailed

Code:

uses a triplet network
good design
sufficiently commented

Functionality/Performance:

the triplet network seem to have converged but the manifold doesn't look great
classifier seem to perform poorly (no accuracy plot/average accuracy for the classfier shown other than the confusion matrix)
future work have been suggested but no attempts have been made to improve either sub networks

General comments:

PR is in the main branch
solves the problem apporpriately although performance is poor
one major reason the triplet isn't doing a good job of separating the two classes could be because your feature extracting backbone network isn't strong enough. Could have tried something like a ResNet or VGG to see if it would improve.

shakes76 · 2023-11-20T23:10:51Z

Marking

Good Practice (Design/Commenting, TF/Torch Usage)

Adequate design and implementation
Good spacing and comments
Header blocks missing -1

Recognition Problem

Solves problem
Driver Script present
File structure present
Shows Usage & Demo & Visualisation & Data usage
Module present
Commenting
No Data leakage
Difficulty: Hard

Commit Log

Meaningful commit messages
Progressive commits used

Documentation

ReadMe acceptable/good
Model/technical explanation
Good Description and Comments
Markdown used and PDF submitted

Pull Request

Successful Pull Request (Working Algorithm Delivered on Time in InCorrect Branch) -2
Feedback required, please change the branch to the correct one, ensure repo READMEs are restored. -2
Request Description good

shakes76 · 2023-11-20T23:10:59Z

Feedback marks possible +2 if the requested changes are made (see above).

wangzhaomxy · 2023-11-21T07:19:48Z

No feedback attempt and no feedback marks granted.

shakes76 and others added 30 commits September 17, 2023 21:47

Added recognition branch and README for info.

c3aff8b

Initial setup for Siamese network Alzheimer’s disease classification.

7e5bc0e

Renamed README.MD to README.md

8d8b7f2

Update to contain rough outline for README.md

9aabb12

Initial draft of Siamese network architecture in modules.py

7e03b36

Added section in code to help determine dataset image dimensions

7d27279

Merge branch 'topic-recognition' of https://github.com/tranvicky/Patt…

7cacc40

…ernAnalysis-2023 into topic-recognition

•TripletLoss setup: Create an initial structure for the triplet loss …

dea52f6

…function in modules.py.

Added TripletDataset to dataset.py to generate anchor, positive (same…

cbb5bea

… patient), and negative (different class) image triplets for Siamese training.

Refactored TripletDataset for enhanced adaptability, ensuring balance…

e096a1e

…d anchor selection and optimized triplet formation for Siamese network training.

Added image transformations and initialized train and test datasets i…

0628f8c

…n train.py

Added functionality to display and save sample triplet images while c…

3387cc5

…hecking for NaN and Inf values in the dataset

Refactored TripletDataset to ensure patient-wise split for train-test…

0d39d54

… data, improved triplet selection using patient IDs, and streamlined path extraction using os.path.basename

Added testing to determine size of training sets after splitting

33b1d47

Merge branch 'topic-recognition' of https://github.com/tranvicky/Patt…

e5b4344

…ernAnalysis-2023 into topic-recognition

Added additional data augmentation steps (random rotation, horizontal…

b04d801

… flip, and vertical flip) when in train mode

Integrated the training loop for the Siamese Network into train.py. A…

cc2038f

…dded visualization of sample images post-training.

Fixed method save_image in train.y so that it properly saves images.

a1058d9

Added validation/testing loop to evaluate model on test set post-trai…

22adb4c

…ning

Implement predict.py to load the trained Siamese Network model and pe…

2896300

…rform predictions on new sample data.

Integrated PCA-based visualization for model embeddings in train.py a…

561d8a3

…nd added functionality to save the 2D scatter plot as embeddings_pca.png.

Changed embedding plots from PCA to T-SNE

3c30104

Added functionality to extract and store embeddings for the entire da…

eda84c8

…taset in train.py This will enable subsequent classifier training on Siamese network embeddings.

Modified dataset.py to include labels, allowing for easier extraction…

0cc7e8d

… and use of labels when processing embeddings post Siamese network training.

Added SimpleClassifier class in the neural network module to facilita…

6a77f09

…te the classification of Siamese network embeddings into AD or NC categories

Implemented the training loop for the Simple Classifier using extract…

3e7fc30

…ed embeddings from the Siamese Network. Utilized the Adam optimizer and CrossEntropyLoss for training.

Added evaluation phase for the Simple Classifier using the test embed…

f0ba8da

…dings. The accuracy metric is computed to assess the classifier's performance on the test set.

Changed labels to detect if AD or NC in it's path.

5f9c135

Added method plot_confusion_matrix to plot confusion matrix so can vi…

2afb4e3

…sualise.

Changed train.py so code can print and out plot of train vs validatio…

3010854

…n losses.

tranvicky and others added 21 commits October 23, 2023 21:21

Added tracking and visualization for the classifier's training and va…

879bc08

…lidation losses. This enhancement aids in monitoring the classifier's performance across epochs.

Changed code to fix issue of tsne graph displaying only one colour

5f66d81

Add code for SNN to be stopped early if validation loss doesn't impro…

6f2924f

…ve for 5 consecutive epochs).

Added changed to train.py to resolve issue of incorrect indexing with…

c4a4810

…in classification evaluation.

Changed train.py code to save plot of siamese training vs losses.

2b5d722

Solved issues with pconfusion matrix plot not showing and issue with …

a9e5b91

…incorrect embedding sizing.

Introduced more convolutional layers and added dropout after each fc …

bc90ea3

…layer

Adjusted hyperparameters for learning rate and batch size.

a11b716

Updated README.md to fill out all of the uncompleted sections.

ca94001

Changed code to resolve dimension bugs after changed number of fc lay…

1af6bb3

…ers.

Updated README.d to include visualisations/plots produced during trai…

dbdaff3

…ning and also added results summary.

Updated README.md to reflect updated dependencies.

28c03b8

Updated predict.py so that if user wants to visualise the model, they…

bad03a6

… can now just run the file. Updated with saving all visualising plots.

Updated README.md to detail a more in-depth setup guide and running f…

bf8a1a4

…or users.

Updated train.py to resolve issue about numpy to tensor error.

53f5e2f

Updated references for README.md

1fb431b

Removed unecessary print statements from train.py, modules.py and dat…

5814d1b

…aset.py.

Provide folder for images/plots/visualisations needed for README.md.

8d8ad25

Update README.md to resolve issue of images only showing links instea…

77b19d5

…d of images and to display images properly.

Updated README.md and fixed image pathways.

fb57496

Changed variable outputs to resolve tensor error in train.py.

6aee14a

nathasha-naranpanawa added the Siamese label Oct 29, 2023

shakes76 added the Extension Extension approved label Nov 20, 2023

shakes76 added the question Further information is requested label Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Topic recognition - s4647936 #165

Topic recognition - s4647936 #165

tranvicky commented Oct 25, 2023

nathasha-naranpanawa commented Nov 6, 2023 •

edited

Loading

shakes76 commented Nov 20, 2023

shakes76 commented Nov 20, 2023

wangzhaomxy commented Nov 21, 2023

Topic recognition - s4647936 #165

Are you sure you want to change the base?

Topic recognition - s4647936 #165

Conversation

tranvicky commented Oct 25, 2023

nathasha-naranpanawa commented Nov 6, 2023 • edited Loading

shakes76 commented Nov 20, 2023

Marking

Good Practice (Design/Commenting, TF/Torch Usage)

Recognition Problem

Commit Log

Documentation

Pull Request

shakes76 commented Nov 20, 2023

wangzhaomxy commented Nov 21, 2023

nathasha-naranpanawa commented Nov 6, 2023 •

edited

Loading